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Minimum aberration is an increasingly popular criterion for com- 
paring and assessing fractional factorial designs, and few would ques- 
tion its importance and usefulness nowadays. In the past decade or 
so, a great deal of work has been done on minimum aberration and 
its various extensions. This paper develops a general theory of min- 
imum aberration based on a sound statistical principle. Our theory 
provides a unified framework for minimum aberration and further 
extends the existing work in the area. More importantly, the theory 
offers a systematic method that enables experimenters to derive their 
own aberration criteria. Our general theory also brings together two 
seemingly separate research areas: one on minimum aberration de- 
signs and the other on designs with requirement sets. To facilitate 
the design construction, we develop a complementary design theory 
for quite a general class of aberration criteria. As an immediate ap- 
plication, we present some construction results on a weak version of 
this class of criteria. 

1. Introduction. The general problem considered in this paper is how 
to select the "best" fractional factorial designs. In situations where we 
have little or no knowledge about the effects that are potentially impor- 
tant, it is appropriate to select designs using the minimum aberration cri- 
terion [Fries and Hunter (1980)]. Wu and Hamada (2000) contains tables 
of many known minimum aberration designs. Minimum aberration designs 
enjoy some attractive robust properties [Cheng, Steinberg and Sun (1999) 
and Tang and Deng (1999)]. Much work has been done on the construction 
of minimum aberration designs. For details, we refer to Franklin (1984), 
Chen and Wu (1991), Chen (1992), Chen and Hedayat (1996), Tang and Wu 
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(1996) , Suen, Chen and Wu (1997) and many others. Sitter, Chen and Feder 

(1997) , Chen and Cheng (1999) and Cheng and Wu (2002) developed aber- 
ration criteria for blocked fractional factorials. A projective geometric ap- 
proach to blocking fractional factorials is considered in Mukerjee and Wu 
(1999), and blocked fractional factorials with maximum estimation capacity 
are studied by Cheng and Mukerjee (2001). Wu and Zhu (2003) examined 
the use of a minimum aberration criterion for design selection in robust 
parameter design. 

Developing a general theory of minimum aberration is motivated by the 
desire to unify various versions of minimum aberration that have recently 
appeared in the literature. Based on a sound statistical principle, this paper 
develops a general theory of minimum aberration and discusses its various 
applications. In addition to building a unified framework for many of the 
existing aberration criteria, the theory provides a method for deriving other 
aberration criteria that may be more appropriate for given design situations. 
A minimum aberration design can be called a model robust design because 
of its robust properties. A design with a requirement set [Greenfield (1976)] 
is a model specific design since such a design specifies a set of effects to be 
estimated. Our general theory is capable of bringing together these seemingly 
unrelated two classes of designs. 

We will focus our discussion on two-level regular fractional factorial de- 
signs. However, most of our arguments are quite general. Section 2 moti- 
vates, introduces and studies a general criterion of minimum aberration and 
discusses its application to blocked fractional factorials, and to fractional 
factorials when some 2-factor interactions are important. Section 3 is de- 
voted to developing a theory of complementary designs for quite a general 
class of aberration criteria, and presents some construction results on weak 
aberration. 

In what follows, we introduce some notation and definitions to set the 
stage for the later development. A regular 2 m ~ p design has m factors each 
at two levels and n = 2 m ~ p runs, and is completely determined by p inde- 
pendent defining words. The two levels are denoted by +1 and —1, so the 
design matrix D of such a design is an n x m matrix of ±1. The defining 
relation of a 2 m ~ p design is the complete set of defining words. Labels of 
factors are referred to as letters. A defining word specifies a set of letters 
that has the property that the product of the corresponding columns of D 
is a column of all plus ones. Including /, the column of all ones, the defining 
relation of a 2 m ~ p design has 2 P defining words. Let Ai(D) be the number 
of defining words of length i in the defining relation of design D, where 
the length of a word is the number of letters in the word. The resolution 
of design D is the integer R such that Ai{D) = for i = 1, . . . ,R — 1 and 
Ar{D) > 0. The minimum aberration criterion selects designs that sequen- 
tially minimize Ai(D), . . . ,A m (D). For designs of resolution at least III, we 
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have A\ = A2 = 0, so the minimum aberration criterion selects designs that 
sequentially minimize A^(D), . . . , A m (D). 

2. General theory of minimum aberration and its applications. 

2.1. A general criterion of minimum aberration. Besides the grand mean 
70, there are in all 2 m — 1 factorial effects in a 2 m ~ p design. Suppose that 
out of the 2 m — 1 effects, we are interested in estimating a set of effects 71. 
Then the fitted model is given by 



where Y denotes the vector of n observations, 71 the vector of the effects to 
be estimated, W\ the model matrix corresponding to 71 and e the vector of 
uncorrelated random errors, assumed to have a zero mean and a constant 
variance. Because the remaining effects may not be negligible, we should 
choose a design that minimizes their contamination on the estimation of 71, 
from among all designs allowing estimation of the model in (1). Suppose 
that prior knowledge enables us to divide these remaining effects into J — 1 
groups, denoted by 72, • • • ,7j, in such a way that the effects in 7j are more 
important than those in 7j+i, for j = 2, . . . , J — 1. Then the true model can 
be written as 



where Wj is the model matrix corresponding to jj for j = 1, . . . , J. The least- 
squares solution 71 = (W] r Wi)~ 1 WiY = n~ l WfY from the fitted model in 
(1) has expectation, taken under the true model in (2), E(-yi) = 71 + C272 + 
• — h Cj7j, where Cj = n _1 WiWj for j > 2. So the bias of 71 in estimating 
71 is C272 + ■ — h Cjjj. Note that Cj^j represents the contribution of jj to 
the bias. As 7j is unknown, we will have to work with Cj. One size measure 

for a matrix C = (cij) is given by ||C|| 2 = f trace (C T C) = J2i,j c ij- Since the 
effects in -yj are more important than those in jj+i, to minimize the bias of 
71, heuristically we can sequentially minimize 1 1 C 2 1 1 2 , • • • , ||Cj|| 2 . For regular 
designs, the entries of Cj are either or 1, and therefore Nj = ||Cj|| 2 is 
simply the number of effects in jj that are aliased with those in 71, for 
j = 2, . . . , J. Two effects are aliased (or confounded) with each other if their 
corresponding columns in the model matrix are identical. 

Definition 1. The general criterion of aberration is defined as the one 
that selects designs by sequentially minimizing N2, . . . ,Nj, where Nj is the 
number of effects in jj that are aliased with those in 71, for j = 2, . . . , J. 



(1) 



Y = 70/ + Wui + e 



(2) 



Y = 70/ + W171 + W272 + • • • + Wjjj + e 
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For convenience, the vector (iVjj, • • • , Nj) is called the word length pattern 
with respect to (71,72, • • • ,7j)- An immediate application is to the situation 
where 71 are the main effects and 7, are the j-factor interactions. In this 
case, we have 

(3) Nj = (j + l)A j+1 + {m-j + l)Aj-! 

for 2 < j < m — 1, and N m = A m _i, where Aj is the number of defining 
words of length j as introduced in Section 1. The relationship in (3) leads 
to the conclusion that sequentially minimizing N2 , A3 , . . . is equivalent to 
sequentially minimizing ^3,^44, 

Lemma 1. If 71 are the main effects andjj are the j-factor interactions, 
then the general criterion of aberration, given in Definition 1, is equivalent 
to the usual criterion of aberration. 

The essential result in Lemma 1 was first given by Tang and Deng (1999), 
who in fact presented their result under a more general framework, where 
both regular and nonregular designs are considered. Superficially, Lemma 1 
provides a statistical justification for the usual criterion of aberration, which 
was originally defined from the combinatorial point of view. A message run- 
ning a bit deeper here is that the usual minimum aberration criterion of 
combinatorial nature can in fact be derived from a general theory based on 
a sound statistical principle. 

A more general result than Lemma 1 can easily be obtained. Let 71 be 
the main effects and all the interactions involving up to q factors. For the 
model in (1) to be estimable, a design of resolution 2q + 1 must exist, which 
implies that Ai = for i = 1, . . . , 2q. Now let jj be the (q — 1 + j)-factor 
interactions for j > 2. We can easily show that 



=1 



Q 



= 1 



,x^fq-5 + j+i\fm-(q-5+j + i)\. 



i-2 J V 2 

i=2 x 7 v 

Since Ai = for i = 1, . . . , 2q, we have 
'2q + l 



N- 



2 



)a 29+1 , W3 =(% +2 )^ +2+ ( 2 ;_ + 1 1 )a 29+ , 



q 

and so on. Noting that the leading term for Nj in (4) is given by ( 2 ' 3_ (J 1+ ' 7 ) ^bg-i+j , 
we conclude that sequentially minimizing N2,N%, ... is equivalent to sequen- 
tially minimizing ^25+1,^20+21 This establishes the following result. 
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Theorem 1. If are the main effects and all the interactions involving 
up to q factors, and jj are the (q — 1 + j) -factor interactions for j > 2, 
then the general criterion of aberration gives rise to the usual criterion of 
aberration that sequentially minimizes A2 q +i,A2 q +2, ■ ■ ■ among all designs of 
resolution 2q + 1 . 

2.2. Application to blocked fractional factorials. In addition to m treat- 
ment factors, a blocked fractional factorial contains rn\ blocking factors. The 
main effects of blocking factors are block effects. So are the interactions of 
blocking factors. Therefore, the total number of block effects produced by 
mi blocking factors is 2 mi — 1. 

To avoid confusion, the terms "factor" and "effect" are carefully used 
in this paper. We stick to the meanings of the terms as in the following: a 
factor has a main effect, two factors have a 2-factor interaction (effect), three 
factors have a 3-factor interaction (effect) and so on. We therefore speak of 
m>i blocking factors and 2 mi — 1 block effects. 

A basic requirement for blocked fractional factorials is that all the 2 mi — 1 
block effects should be included in the fitted model. In addition, interac- 
tions between treatment and blocking factors are assumed to be nonexis- 
tent, which is necessary for the effectiveness of blocking. Now consider all 
the treatment effects. To apply the general theory, we need to specify a set 
of treatment effects we want to estimate. Then the fitted model contains 
these treatment effects in addition to all the block effects. In what follows, 
we look at two important special cases. 

The first case is that the main effects of the m treatment factors are in 
the fitted model. Then 71 in model (1) consists of the main effects of all 
the m treatment factors and all the 2 mi — 1 block effects. For the remaining 
treatment effects, we assume as usual that the hierarchical ordering principle 
applies [Wu and Hamada (2000)], and therefore 7^ in model (2) represents 
the vector of all the j'-factor interactions of treatment factors, where j = 
2,...,m. 

A defining word in a blocked fractional factorial is a subset of m + m\ 
letters among which m letters represent treatment factors and m\ letters 
represent blocking factors. Let Aj be the number of defining words of length 
j that contain no blocking factors, and let Bj be the number of defining 
words that contain j treatment factors and at least one blocking factor. 
Note that we must have A\ = A2 = Bq = B\ = for the fitted model to be 
estimable. 

Proposition I. Letji denote all main effects of treatment factors and 
all block effects, and let 7, denote all the j -factor interactions of treatment 



G 



C.-S. CHENG AND B. TANG 



factors. Then the word length pattern (N2, • • • , N m ) is given by N2 = 3A 3 + 
B2 1 -^3 = + B3 , and in general 

(5) Nj = (j + l)A j+1 + (m-j + l)A j _ 1 + B j , 

where Aj and Bj are defined in the preceding paragraph. 

The proof is straightforward. Our general criterion of aberration for blocked 
fractional factorials therefore selects designs by sequentially minimizing N2 = 
3^3 + B2, N3 = 4A4 + B3 and so on. Chen and Cheng (1999) proposed a 
criterion of aberration, and using our notation, their word length pattern 
is given by (3^3 + B2, A4, 10 A§ + B%, Aq, ■ ■ ■)■ We see that the leading com- 
ponent in their criterion is identical to the leading component N2 in our 
general criterion of aberration. Sitter, Chen and Feder (1997) also proposed 
a criterion that sequentially minimizes A3, B2, A4, B3, A5, B4, and so on. If 
the magnitude of A7+1 is about the same as or larger than that of Bj, the 
criterion of Sitter, Chen and Feder (1997) provides a reasonably good ap- 
proximation to our general criterion. We give an illustration using a simple 
example. 

Example 1. Suppose that we want to study nine factors in 16 runs, 
which are to be arranged in two blocks. We use 1, ... ,9 to denote the nine 
factors, and b to denote the single blocking factor. Consider the following two 
designs. Design D 1 is given by 5 = 123, 6 = 124, 7 = 134, 8 = 234, 9 = 12, and 
b = 13, and design D 2 given by 5 = 123, 6 = 124, 7 = 134, 8 = 13, 9 = 12, 
and b = 234. One can easily verify that A 3 (D\) = 4 and B2{D\) = 4, and 
A 3 (D 2 ) = 6 and B 2 {D 2 ) = 2. The criterion of Sitter, Chen and Feder (1997) 
selects D\ as a better design because D± has a smaller value of A3. Now 
applying our criterion, we see that iv^-Di) = 16 and N2{D 2 ) = 20, and again 
D\ is better. Note that design D\ in fact has a larger value of B2 but its 
smaller value of A3 plays a dominant role here. 

The other important special case is that we are interested in estimating all 
main effects and all 2-factor interactions of treatment factors. So 71 consists 
of all the main effects and all the 2-factor interactions of treatment factors, as 
well as all the block effects. For j > 2, 7,- is the vector of all the (j + l)-factor 
interactions of the treatment factors. For the fitted model to be estimable, 
we must have A\ = A2 = A3 = A4 = Bq = B\ = B2 = 0. Applying our general 
theory, we obtain the following. 

Proposition 2. Suppose that 71 consists of all main effects and all 2- 
f actor interactions of treatment factors, as well as all block effects. Let 7j be 
the vector of all the (j + 1) -factor interactions of the treatment factors for 
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j > 2. Then the word length pattern (N2, N3, . . .) is given by N2 = 10^5 + B%, 
N3 = 15 Aq + 5^5 + B4, and in general 



Proposition 2 is easily established by a simple combinatorial argument. 
Comparing our criterion with that of Chen and Cheng (1999), we find that 
the leading component in their criterion becomes IOA5 + B3, which is pre- 
cisely the N2 given by our general theory. In fact, we have verified that the 
leading component in the word length pattern of Chen and Cheng (1999) is 
also correct if in addition to all block effects, the true model consists of all 
main effects and all interactions involving up to q factors with q > 3. One can 
therefore appropriately regard the aberration criterion of Chen and Cheng 
(1999) as a robust version of our general aberration criterion when applied 
to blocked fractional factorials. 

Before moving on, we remark that like other work in the area, block 
effects are treated as fixed effects in this paper. Our discussion in this section 
focuses on the situation where we are interested in estimating these block 
effects. If the block effects are not of interest, the contamination on their 
estimation due to nonnegligible treatment effects will not be a concern. Our 
general criterion can easily be modified to accommodate this situation. In 
the meantime, many new issues arise and they will be looked into in the 
future. 

2.3. Fractional factorials when some 2-factor interactions are important. 
Suppose that a set of 2-factor interactions (2fi's) is postulated to be impor- 
tant, and in addition to the main effects, we are also interested in estimating 
these important 2fi's. In this situation the fitted model in (1) consists of all 
main effects and these important 2fi's. For the remaining effects, we assume 
as usual that the hierarchical ordering principle applies. Using the notation 
in Section 2.1, we have that 71 represents the main effects and the impor- 
tant 2fi's, 72 represents the remaining 2fi's and jj represents the j-factor 
interactions for j > 3. 

A 2fi of a fractional factorial D can be represented by an unordered pair 
(c, d), where c and d are two columns of D. Let (ci,d±), . . . , (cs, ds) denote 
the important 2fi's. For each 2fi (c s ,d s ) where s = 1, . . . , S, let Aj(c s ,d s ) be 
the number of length- j words containing both letters c s and d s , let Aj(c s ,d s ) 
be the number of length-j words containing c s but not d s , let Aj(c s ,d s ) be 




(6) 




where Aj and Bj are defined as before. 
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the number of length- j words containing d s but not c s , and let Aj(c s ,d s ) 
be the number of length- j words containing neither c s nor d s . Obviously, 
Aj = Aj(c s ,d s ) + Aj(c s ,d s ) +Aj(c s ,d s ) + Aj(c s ,d s ). Let 



s s 
A f =J2Mcs,d s ), Af =Y,[A j {c s1 d s ) + Aj(c s ,d s )}, 



(7) 



s=l 

S 



s=l 

If a defining word of length j contains more than one pair of letters in the 
list of the important 2fi's (c±,di), . . . , (cs,ds), it is counted more than once 

in calculating A - . So A - in fact represents the total number of times that 
a defining word of length j contains an important 2fi (c s ,d s ). Interpretation 

of Af and Aj is similar. 

Proposition 3. When some 2fi's are important, the word length pattern 

(N 2 ,N 3 ,...,N m ) is given by N 2 = 3A 3 + A^ , N 3 = AA 4 + a£ ] + A^ and 
in general 

(8) Nj = (j + l)A j+1 + {m-j + + A% + Af + A?} 2 , 
where Af 2 > A.f and Af 2 are defined in (7). 

The above version of the word length pattern is given in terms of the 
defining words of the original design matrix D. We now present another 
version in terms of the defining words of the augmented design given by the 
model matrix W\ = (D, D 2 ), where W\ is as in model (1) and D 2 corresponds 
to the important 2fi's in the fitted model. This latter version is convenient 
for developing a general complementary design theory in Section 3. 

Consider the words in the defining relation of the augmented design W\ = 
(D,D 2 ). Let Aj be the number of length-j words having all their j letters 
from D, and let Bj be the number of length-(j + 1) words having j letters 
from D and one letter from D 2 . 

Proposition 4. When some 2fi's are important, the word length pattern 
(N 2 ,Ns, . . . , N m ) is given by N 2 = 3A3 + B 2 — S, and 

(9) Nj = {j + l)A j+1 + {m-j + + Bj 
for j > 3. 

In Proposition 4, dependence of Nj on the important 2fi's is expressed 
through Bj, which depends on matrix D 2 , given by the columns of the 
important 2fi's. 
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The expression for N2 in Proposition 4 needs a bit of explanation. Let 
(c s ,d s ) be an important 2fi, for s = 1, . . . , S. Then the three columns c s , d s 
and c s d s , where columns c s and d s are from D and column c s d s is from D2, 
form a word of length 3 that contributes to B2 but not to N2 ■ This explains 
why we have N2 = 3 A3 + B2 — S instead of N2 = 3^3 + B2 ■ 

Ke and Tang (2003) examined practical issues in design selection using 
the general criterion of aberration when some 2fi's are important, and pre- 
sented a collection of designs of 16 and 32 runs for models containing up to 
four important 2fi's. 

2.4. Other applications. In robust parameter design, there are two sets 
of factors, control factors and noise factors. The goal of the experiment is 
to choose the settings of control factors so that the response variable is in- 
sensitive to noise factors. Suitable designs should therefore allow analysis 
of both location and dispersion effects. Wu and Zhu (2003) examined the 
use of an aberration criterion for robust parameter design which is mainly 
motivated by the analysis of location effects. It would be interesting to see 
how our general theory can be modified to take into account the analysis of 
dispersion effects. One possibility is to select designs using our criterion from 
among those designs allowing suitable analysis of dispersion effects as can be 
found in Hedayat and Stufken (1999). Our general theory is potentially use- 
ful in fractional factorial split plot designs. Huang, Chen and Voelkel (1998) 
and Bingham and Sitter (1999) considered aberration criteria for split plot 
designs. Since split plot designs have more than one error structure, some 
sort of modification seems necessary for our theory to be applicable to such 
problems. In our future research, both areas of application will be consid- 
ered. 

3. Theory of complementary designs. In this section, we will develop a 
complementary design theory for a class of aberration criteria. This class of 
criteria, to be introduced below, is quite broad, and in particular includes 
as special cases the aberration criteria for blocked fractional factorials and 
for designs when some 2fi's are important as discussed in Sections 2.2 and 
2.3, respectively. 

3.1. A class of aberration criteria. Suppose that besides the main ef- 
fects 7[ of m factors, we are also interested in estimating additional S effects 
7". For convenience, these m factors are called major factors. In addition to 
the m major factors, we may have mi minor factors, where mi > 0. When 
mi = 0, the additional S effects 7" are a set of interactions only involving 
major factors. When mi > 1, the effects in 7" are a subset of effects from 
the collection of all the following effects: the interactions only involving ma- 
jor factors, the main effects of minor factors, the interactions only involving 
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minor factors and the interactions involving both major and minor factors. 
The 71 in the fitted model (1) is therefore given by 71 = (7^, 7"). Let jj 
denote the j -factor interactions only involving the major factors that are 
not included in 7", for j = 2, . . . , m. We assume as earlier that the effects in 
7j are more important than those in 7j+i, for j > 2. With 71, ■ • ■ , 7 m defined 
above, the true model is now given in (2). A remark on the true model is 
in order when there is at least one minor factor, that is, mi > 1. An im- 
plicit assumption made here is that all other effects involving at least one 
minor factor besides those in 7" are assumed to be nonexistent. The above 
formulation is fairly general, and includes as special cases all the situations 
discussed in Sections 2.1-2.3. For example, for blocked fractional factorials, 
we take the treatment factors as the major factors and the blocking factors 
as the minor factors. Choices for major and minor factors are also natu- 
ral for fractional factorials in the row-column setting [Cheng and Mukerjee 
(2003)]. 

We now derive the word length pattern for the above situation. Let D\ 
be the design matrix corresponding to the main effects 7J of the m major 
factors and let D2 be the matrix corresponding to the additional S effects 
7". Note that D\ has m columns and D2 has S columns. The word length 
pattern will be given in terms of the model matrix W = (Z?i,Z?2) 5 specified 
by its two components D\ and D2. Let Aj(D\) be the number of length- j 
defining words in design D\. Define Bj(D\,D2) to be the number of length- 
(j + 1) defining words in design W = {D\,D2), which have j letters from D\ 
and one letter from D2- Then it is easily established that the word length 
pattern (N2, ■ ■ ■ , N m ) is given by 

Nj(Dx,D2) = (j + l)A, + i(I>i) + (m - j + l)Af_i(Z>i) + B^DuIh) ~ Sj 

for j > 2, where Sj is the number of the interactions of j major factors that 
are included in 7". (For an explanation of why Sj is necessary, see the end 
of Section 2.3.) Note that Sj is a constant for the purpose of choosing D\ 
and Z?2- For simplicity, ignoring Sj, we redefine the word length pattern 
(N 2 ,...,N m ) as 

(10) Nj{D l ,D 2 ) = (j + l)Aj +1 (Di) + (m-j + lJA^pi) + B j (D 1 ,D 2 ). 
The goal here is to choose D\ and D2 by sequentially minimizing N2, N3, 

3.2. A complementary design theory. Let B\] z = (D\, D2, -D3), where 
denotes a saturated design of n = 2 k runs and n — 1 factors. Obviously, 
it is impossible to completely characterize design pair (D\,D2) through D3 
alone. Our complementary design theory to be developed below characterizes 
design pair (Di,!^) through design pair (D2,D^). This approach is most 
effective when the number of columns in D% is smaller than that in D\. 
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We need a result from Tang and Wu (1996) and Suen, Chen and Wu 
(1997), who developed a complementary design theory for the usual mini- 
mum aberration criterion. The explicit coefficients in Lemma 2 are due to 
Suen, Chen and Wu (1997). 

Lemma 2. Let = (D,D), where D has m factors. Then we have 

j 

A 3 (D) = J2c m (i,j)MD), 

i=0 

where c m (l,j) = c m (2, j) = 0, c m (i, j) = (-l) j ~ [U " i)/2] {Z~-%2]) for3<i< 
j, and 

c m (0,j) = (-l)i-bV2] ( m L T / 2 2 " _1 ) +2-*[P i (0;m) - Pj^m)], 
where Pj(x;m) = J2l=o(~ l) s (s) C^j-s) ^ s a Krawtchouk polynomial. 

Note that Nj(D±, D 2 ) in (10) depends on design pair (Di,D 2 ). The main 
result of our complementary design theory is contained in the following 
theorem, which expresses Nj(Di,D 2 ) in terms of design pair (D 2 ,D 3 ). 

Theorem 2. The word length pattern in (10) for the class of criteria 
discussed in Section 3.1 depends on design pair (D 2 ,D 3 ) through 

j'+i 

N j (D 1 ,D 2 ) = J2iU + 1 " S)c m (i,j + 1) + Sc m+1 (i,j + 1)]MD 2 U D 3 ) 

8=0 

3+1 

-J2cm+i(i,j + l)Ei(D 2 ,D 3 ) 
i=0 

3-1 

+ (m - j + 1) c m(i,j - 1)M°2 U D 3 ), 
i=0 

where Ei(D 2 ,D 3 ) = J^^pE^ (D 2l D 3 ) with E^\D 2 ,D 3 ) denoting the num- 
ber of length-i defining words in D 2 U D 3 that have exactly p letters from D 2 . 

Proof. Applying Lemma 2 to design D±, we have 

3 

(11) A j (D 1 ) = J2c m (i,3)A(D 2 UD 3 ). 

For any d € D 2 , applying Lemma 2 to design D\ U {d}, we obtain 

3 

(12) Aj(Di U {d}) = ^ ^(i, j)Ai{{D 2 \ {d}) U D 3 ). 

i=0 
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Let Bj_i(d, D\) be the number of length- j defining words in design (Di, D 2 ) 
that contain letter d and j — 1 letters from D\. Clearly, we have Aj(D\ U 
{d})=A j (D 1 ) + B j _ 1 (d,D 1 ). Let 

3 

(13) Tj (d, D 2 ,D 3 )=J2 c m+1 (i,j)M (D 2 \{d})UD 3 ). 

t=0 

Then (12) can be rewritten as 

(14) Aj(Dx) +5j-_i(d,jDi) = Tj(d, D 2 ,D 3 ). 
Taking summation over all <i in D 2 on both sides of (14) gives 

d£D 2 deD 2 

Noting that Z?2 has S* columns and that Bj-\(D\, D 2 ) defined in Section 3.1 
is equal to J2deD 2 Bj-i(d,Di), we obtain SAj(Di) + Bj-i(Di,D 2 ) = Tj(D 2 ,D 3 ), 
where 

(15) Tj(D 2 ,D 3 )= J2 T 3 {d,D 2 ,D 3 ). 

d£D 2 

Therefore, Bj(Di, D 2 ) = Tj + i(D 2 , D 3 ) — SAj + i(Di). Substituting this ex- 
pression of Bj(Di, D 2 ) into (10), we obtain 

Nj(D lt D 2 ) 

(16) 

= (j + 1 - S)A j+1 (D 1 )+T j+1 (D 2 ,D 3 ) + (m-j + l)A j _ 1 (D 1 ). 

Now let us calculate Tj + \{D 2 ,D 3 ) in (15). Let Ei(d, D 2 , D 3 ) be the number 
of length- i defining words in (D 2 ,D 3 ) that contain letter d. We have Ai({D 2 \ 
{d})UD 3 ) = Ai(D 2 UD 3 )-Eild,D 2 ,D 3 ). Then (13) becomes 

j'+i 

T j+1 (d,D 2 ,D 3 ) = Y,c m +i(i,j + 1)[MD 2 UD 3 ) - Ei(d,D 2 ,D 3 )]. 

Summing both sides over all d in D 2 , we obtain 

j+i 

T J+l {D 2 ,D 3 ) = SY / c m +i(iJ + l)MD 2 U D 3 ) 

i=0 

(17) 

-Xc m+ i(i,j + 1)^(^2,1)3), 

i=0 

where Ei(D 2 ,D 3 ) = E d eD 2 E M D 2 ,D 3 ). Note that E;(d, As, A3) is the num- 
ber of length- i words containing letter d in design D 2 L)D 3 . Thus Ei(D 2 ,D 3 ) 
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represents the total number of times length-i words in design D2 U D 3 con- 
tain a letter in D2. Therefore 

i 

(is) e 1 (d 2 ,d 3 ) = J2p4 p) (D2,d 3 ), 

p=l 

(p) 

where E\ denotes the number of length- z words in D2 U D 3 having exactly p 
letters from Combining (11) and (16)-(18), we obtain the result in Theo- 
rem 2. 

□ 

Chen and Cheng (1999) developed a complementary design theory for 
blocked fractional factorials. Our complementary design theory given in The- 
orem 2 is applicable to a broad class of aberration criteria including all the 
cases discussed in Sections 2.1-2.3. We want to mention that our theory 
does not include their theory as a special case, because our word length pat- 
tern when applied to blocked factorials is not exactly the same as theirs. On 
the other hand, one can adopt our approach to derive their complementary 
design theory. Our approach appears considerably simpler than theirs. 

Zhu (2003) found a relationship between AijQ and Aqm, where A^q is 
the number of length-(i + j) words having i letters from D\ and j letters 
from D 2 , and Aqm is the number of length-(/c + /) words having k letters 
from Z?2 and / letters from D 3 . In principle, Theorem 2 is derivable from his 
result. On the other hand, it is not obvious how one can obtain the result in 
Theorem 2, which clearly shows how N 2 depends on D2 and D 3 , from Zhu's 
rather involved formula that connects A^jq to Aq^i ■ 

3.3. Some results on weak aberration. The general criterion of aberration 

sequentially minimizes N2(D±, D2), N^(Di, D2), A weak version of the 

criterion is given by minimizing iN^-Di, D2) = 3^3(Di) + B2{D\, D2) alone. 
Using Theorem 2, we find that 

N 2 (D 1 ,D 2 ) = constant - 3A 3 {D 2 U D 3 ) + E 3 (D 2 , D 3 ), 

where the constant does not depend on D2 and D 3 . Noting that 

E 3 (D 2 ,D 3 ) = Ei 1} (D 2 ,D 3 ) + 2Ef ] (D 2 ,D 3 ) + ZE^ (D 2 , D 3 ), 

A 3 (D 2 U D 3 ) = A 3 (D 3 ) + EP(D 2 ,D 3 ) + 4 2) (D 2 ,D 3 ) + Ef\D 2 ,D 3 ), 
we have that 

N 2 (Di,D 2 ) = constant - 3^3(^3) - 2E^ ] (D 2 ,D 3 ) - E^(D 2 ,D 3 ). 
So minimizing N 2 (Di,D 2 ) is equivalent to maximizing 

(19) g(D 2 ,D 3 ) = 3A 3 (D 3 ) + 2EP (D 2 ,D 3 ) + (D 2 ,D 3 ). 

The following lemma gives an upper bound on g(D2,D 3 ). 
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Lemma 3. Let m 2 = S and m 3 = 2 k — 1 — m — S be the numbers of 
columns in D 2 and D 3 , respectively. We have that: 

(i) 3A 3 (D 3 ) + E^(D 2 ,D 3 )<( m ^, 

(ii) Ei 1 \D 2 ,D 3 )+E i 3 2) (D 2 ,D 3 ) <m 2 m 3 /2, and 

(iii) g(D 2 ,D 3 )<m 3 (m 2 + m 3 -l)/2. 

The upper bound in (iii) is reached if and only if the bounds in (i) and (ii) 
are both reached. 

Proof. For any two columns c and d in D 3 , the product cd must belong 
to one of Di, D 2 or D 3 . Consider all the (™ 3 ) pairs of columns in D 3 . The 

number of the pairs whose products are in D 2 U D 3 is given by E^ [D 2 , D 3 ) + 
3A 3 (D 3 ). Therefore 

(20) eP(D 2 ,D 3 ) + 3A 3 (D 3 )< (™ 3 )' 

which proves part (i) of Lemma 3. Similarly, by considering all the products 
cd such that cd € D 2 U D 3 where c € D 2 and d £ D 3 , we obtain 

(21) 2E^ (D 2 ,D 3 ) + 2E { 3 2) (D 2 ,D 3 ) < m 2 m 3 , 

from which Lemma 3(h) follows. Combining (20) and (21), we obtain 

g(D 2 ,D 3 )< (j 1 ^ + m 2 m 3 /2 = m 3 (m 2 + m 3 - l)/2. 

This is Lemma 3 (iii). The last statement in Lemma 3 is obvious. □ 

From the proof of Lemma 3, we see that the bound in (20) is reached if 
cd £ D 2 U D 3 for any two columns c,d& D 3 , and that the bound in (21) is 
reached if cd G D 2 U D 3 for any c G D 2 and any d € D 3 . One structure for 
D 2 and D 3 to have these properties is given as follows. Let ai,a 2 , . . . ,ak be 
a set of k independent columns that generates the saturated design of 
n = 2 k runs and n — 1 factors. Now choose Z?2 U -D3 to be H r = H {a\ , . . . , a r ) , 
the saturated design generated by independent columns a± , . . . , a r where 
r = 1, . . . , k — 1. Note that D 2 can be any m 2 = S columns from H r . 

Theorem 3. Let H r be the saturated design generated by r independent 
columns ai,...,a r . Then so long as D 2 U D 3 = H r , any design pair (D 2 ,D 3 ) 
maximizes g(D 2 ,D 3 ) in (19). Therefore design pair (D\,D 2 ) has minimum 
weak aberration, where D\ is given by \ H r . 

Recall that in introducing the class of aberration criteria in Section 3.1, 
design D\ corresponds to the main effects of m major factors and design D 2 
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represents 777-2 = S additional effects we are interested in estimating, which 
may involve some minor factors. In order for design pair (D\,D2) given in 
Theorem 3 to be a legitimate design, we need to specify D2 in such a way that 
it indeed represents the S additional effects. We now look at two situations. 
The first is that D\ represents the main effects of m treatment factors, and 
D2 are all the 2 mi — 1 block effects given by mi blocking factors. Then 
choosing D2 = H mi , the saturated design generated by a± , . . . , a mi where 
mi < 7" in Theorem 3, satisfies the requirement. This characterization for 
blocked designs was given in Chen and Cheng (1999). We see that it is now 
derived from Theorem 3. The second situation we will look at is that D\ are 
the main effects of m factors, and D2 are some 2-factor interactions of the 
777 factors. We illustrate how to choose D2 through an example. 

Example 2. Suppose that we want a 16-run design that allows esti- 
mation of the main effects 1,2,3,4,5,6,7 and 8 of eight factors and the 
following 2-factor interactions: 12,13,24 and 35. Let 01,02,03,04 be four 
independent columns. Theorem 3 says that we should choose D2 U D3 = 
{01,02,0102,03,0301,0302,030102}. That is, the eight factors are assigned to 
the columns in D\ ={04,0401,0402,040102,0403,040301,040302,04030102}- 
Now assign factor 1 to 04, factor 2 to 0401, factor 3 to 0402, factor 4 to 0403 
and factor 5 to 040301. Factors 6,7 and 8 can be arbitrarily assigned to the 
remaining three columns in D\. We have that 12 = ai, 13 = 02, 24 = 0103 
and 35 = 010203. So D2 = {ai, a2> ^103, 010203}. 
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